# Bilingual Support (Chinese-English)

GLM Z1 9B 0414 GGUF
MIT
GLM-Z1-9B-0414 is a 9B-parameter open-source model from the GLM family, specializing in mathematical reasoning and general task capabilities, excelling in resource-constrained scenarios.
Large Language Model Supports Multiple Languages
G
unsloth
2,258
5
Wan2.1 T2V 14B
Apache-2.0
Wan2.1 is an open and advanced large-scale video generation model with top-tier performance, capable of running on consumer-grade GPUs and excelling in multitask processing.
Text-to-Video Supports Multiple Languages
W
wan-community
17
0
Qwen2.5 VL 32B Instruct GGUF
Apache-2.0
Qwen2.5-VL-32B-Instruct is a multimodal vision-language model that supports joint understanding and generation tasks for both images and text.
Image-to-Text English
Q
samgreen
25.59k
6
Qwen2.5 VL 3B Instruct GPTQ Int3
Apache-2.0
The GPTQ-Int3 quantized version of Qwen2.5-VL-3B-Instruct, suitable for multimodal image-text processing tasks with reduced VRAM usage and faster inference speed.
Image-to-Text Transformers Supports Multiple Languages
Q
hfl
60
1
Qwen2.5 VL 7B Instruct GPTQ Int3
Apache-2.0
This is an unofficial GPTQ-Int3 quantized version based on the Qwen2.5-VL-7B-Instruct model, suitable for multimodal image-text-to-text tasks.
Image-to-Text Transformers Supports Multiple Languages
Q
hfl
577
1
Wiroai Finance Qwen 1.5B
Apache-2.0
Financial domain-specific language model based on Qwen architecture, fine-tuned with 500k+ financial instructions
Large Language Model Transformers
W
WiroAI
886
16
Bge Reranker Large Q4 K M GGUF
MIT
This model is converted from BAAI/bge-reranker-large into GGUF format for reranking tasks, supporting both Chinese and English.
Text Embedding Supports Multiple Languages
B
DrRos
164
1
Llava Video 7B Qwen2
Apache-2.0
The LLaVA-Video model is a 7B-parameter multimodal model based on the Qwen2 language model, specializing in video understanding tasks and supporting 64-frame video input.
Video-to-Text Transformers English
L
lmms-lab
34.28k
91
Minicpm Llama3 V 2 5 GGUF
MiniCPM-Llama3-V-2_5 is a multimodal visual question answering model based on the Llama3 architecture, supporting both Chinese and English interactions.
Text-to-Image Supports Multiple Languages
M
gaianet
112
3
Llama 3 Chinese 8b Instruct V3
Apache-2.0
Llama-3-Chinese-8B-Instruct-v3 is a Chinese instruction model fine-tuned from multiple hybrid models, suitable for dialogue, Q&A, and similar scenarios.
Large Language Model Transformers Supports Multiple Languages
L
hfl
468
62
360VL 70B
Apache-2.0
360VL is an open-source large multimodal model developed based on the LLama3 language model, featuring powerful image understanding and bilingual text support capabilities.
Text-to-Image Transformers Supports Multiple Languages
3
qihoo360
103
10
360VL 8B
Apache-2.0
360VL is a multimodal model developed based on the LLama3 language model, featuring powerful image understanding and bilingual dialogue capabilities.
Text-to-Image Transformers Supports Multiple Languages
3
qihoo360
22
13
Yi VL 6B Hf
Other
Yi-VL-6B is a multimodal vision-language model developed by 01-AI, supporting both Chinese and English, suitable for tasks like visual question answering.
Image-to-Text Transformers Supports Multiple Languages
Y
BUAADreamer
55
2
Llama3 8B Slerp Biomed Chat Chinese
A Chinese biomedical and general chat model based on Llama3-8B, merging two specialized models using the slerp method
Large Language Model Transformers Supports Multiple Languages
L
shanchen
2,599
1
Minicpm V 2
MiniCPM-V 2.0 is a powerful multimodal large language model designed for efficient terminal deployment, built upon SigLip-400M and MiniCPM-2.4B and connected via a perceptual resampler.
Text-to-Image Transformers Supports Multiple Languages
M
openbmb
9,097
461
Haruhi Dialogue Speaker Extract Qwen18
Apache-2.0
A dialogue extraction model fine-tuned based on qwen-1.8, capable of batch extracting summaries and dialogues from novel excerpts
Text Generation Transformers Supports Multiple Languages
H
silk-road
17
3
Tiny Llava V1 Hf
Apache-2.0
TinyLLaVA is a compact large-scale multimodal model framework focused on vision-language tasks, featuring small parameter size yet excellent performance.
Image-to-Text Transformers Supports Multiple Languages
T
bczhou
2,372
57
Causallm 7B DPO Alpha GGUF
A 7B-parameter large language model based on Llama 2 architecture, optimized through DPO training, supporting Chinese and English text generation
Large Language Model Supports Multiple Languages
C
tastypear
367
36
Zhilu 13B Instruct
Apache-2.0
ZhiLu is a financial large language model developed based on Chinese Alpaca2-13B, achieving capability leaps through massive incremental pre-training with Chinese and English corpora and high-quality instruction data alignment, with a focus on enhancing performance in the financial domain.
Large Language Model Transformers
Z
SYSU-MUCFC-FinTech-Research-Center
26
3
Glm 2b
GLM-2B is a general-purpose language model pre-trained with autoregressive blank filling objectives, supporting various natural language understanding and generation tasks.
Large Language Model Transformers English
G
THUDM
60
16
Hupd T5 Small
This model is a fine-tuned T5-small model based on the HUPD dataset, specifically designed for patent text summarization tasks.
Text Generation Transformers English
H
HUPD
21
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase